AITopics | ultimate optimizer

Collaborating Authors

ultimate optimizer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Gradient Descent: The Ultimate Optimizer

Neural Information Processing SystemsDec-24-2025, 00:47:37 GMT

gradient descent, name change, ultimate optimizer, (5 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.08)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.41)

Add feedback

Gradient Descent: The Ultimate Optimizer

Neural Information Processing SystemsOct-10-2024, 15:25:03 GMT

Working with any gradient-based machine learning algorithm involves the tedious task of tuning the optimizer's hyperparameters, such as its step size. Recent work has shown how the step size can itself be optimized alongside the model parameters by manually deriving expressions for "hypergradients" ahead of time.We show how to automatically compute hypergradients with a simple and elegant modification to backpropagation. This allows us to easily apply the method to other optimizers and hyperparameters (e.g. We can even recursively apply the method to its own hyper-hyperparameters, and so on ad infinitum. As these towers of optimizers grow taller, they become less sensitive to the initial choice of hyperparameters.

gradient descent, hyperparameter, ultimate optimizer, (3 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.10)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.45)

Add feedback

SCoTTi: Save Computation at Training Time with an adaptive framework

Lin, Ziyu, Tartaglione, Enzo, Nguyen, Van-Tam

arXiv.org Artificial IntelligenceDec-19-2023

On-device training is an emerging approach in machine learning where models are trained on edge devices, aiming to enhance privacy protection and real-time performance. However, edge devices typically possess restricted computational power and resources, making it challenging to perform computationally intensive model training tasks. Consequently, reducing resource consumption during training has become a pressing concern in this field. To this end, we propose SCoTTi (Save Computation at Training Time), an adaptive framework that addresses the aforementioned challenge. It leverages an optimizable threshold parameter to effectively reduce the number of neuron updates during training which corresponds to a decrease in memory and computation footprint. Our proposed approach demonstrates superior performance compared to the state-of-the-art methods regarding computational resource savings on various commonly employed benchmarks and popular architectures, including ResNets, MobileNet, and Swin-T.

neuron, optimizer, scotti, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICCVW60793.2023.00156

2312.12483

Country:

North America > United States > California > San Diego County > San Diego (0.04)
Europe > Slovakia > Bratislava > Bratislava (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Industry: Information Technology > Security & Privacy (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

#NeurIPS2022 outstanding paper – Gradient descent: the ultimate optimizer

AIHubNov-30-2022, 10:45:06 GMT

Kartik Chandra, Audrey Xie, Jonathan Ragan-Kelley and Erik Meijer won a NeurIPS 2022 outstanding paper award for their work Gradient descent: the ultimate optimizer. Here, they tell us more about their work, the methodology and their main findings. Our paper studies the classic problem of "hyperparameter optimization". Nearly all of today's machine learning algorithms use a process called "stochastic gradient descent" (SGD) to train neural networks. SGD requires users to pick certain settings, or "hyperparameters," before running it.

gradient descent, hyperparameter, ultimate optimizer, (6 more...)

AIHub

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback

Gradient Descent: The Ultimate Optimizer

Chandra, Kartik, Meijer, Erik, Andow, Samantha, Arroyo-Fang, Emilio, Dea, Irene, George, Johann, Grueter, Melissa, Hosmer, Basil, Stumpos, Steffi, Tempest, Alanna, Yang, Shannon

arXiv.org Machine LearningSep-29-2019

Working with any gradient-based machine learning algorithm involves the tedious task of tuning the optimizer's hyperparameters, such as the learning rate. There exist many techniques for automated hyperparameter optimization, but they typically introduce even more hyperparameters to control the hyperparameter optimization process. We propose to instead learn the hyperparameters themselves by gradient descent, and furthermore to learn the hyper-hyperparameters by gradient descent as well, and so on ad infinitum. As these towers of gradient-based optimizers grow, they become significantly less sensitive to the choice of top-level hyperparameters, hence decreasing the burden on the user to search for optimal values.

hyperparameter, optimizer, sgd, (13 more...)

arXiv.org Machine Learning

1909.13371

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > California > San Mateo County > Menlo Park (0.04)
Europe > Spain > Canary Islands (0.04)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback